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BACKGROUND OF THE INVENTION 



In the United States; urogenital infections 
with Chl^WY<aia trachomatig are the leading sexually 
transmitted disease with an estimated 10 million new 
cases reported each year. The urogenital infections 
result in involuntary infertility in 100-200,000 
women each year. It is estimated that equal or 
greater numi>ers of these infections occur in 
European countries, in addition, ocular infection 
vith C. trach9niatl8 results in blinding trachoma 
which afflicts approximately 500 million individuals 
from developing countries. Improved diagnostic 
methodologies and vaccines or immunoprophylactic 
preventative measures are need to manage and control 
the disease. 

ghlflffiVdia trachomattfi isolates occur as 15 
distinct serovars. Based on serological 
relatedness, these 15 serovars have been divided 
into three serogroups; B serogroup (serovars B, Ba, 
D, E, LI and L2) , intermediate serogroup (serovars 
F, G, K and L3), and C serogroup (serovars A, C, H, 
I and J) . The antigen that confers serovar and 
serogroup-specificities to chlamydia is the major 
outer menJbrane protein (MONP) , and protective 
immunity developed during chlamydial ocular 
Infection is thought to be directed at serovar and 
serogroup MOKP determinants. 

The genes encoding the MOMP of the £. 
trflChgffifltia serovars A, B, LI and L2 have been 
cloned and sequenced (Pickett et al., FEHS 
Microbiol. Lett. 12:185*190 (1987); Stephen et al.; 
J. Bacteriol. lift: 1277-1282 (1986); and Stephens et 
al«, J. Bacteriol 3879-3885 (1987); Baehr et 
al., Prbc. Natl. Acad. Sci. USA, 85:4000*4004 



(1988)). Comparative analysis of their amino acid 

sequences show the MOMP qenes to be highly conser^/ed 

structures that contain four evenly spaced domains 

whose sequences vary among the different serovars. 

The locations of these nucleotide and amino acid 

sequence variable domains (VDs) are: 

vol - nucleotices 256-315 and residues 64-83; 

VDII - nucleotides 481-546 and residues 139-160; 

VDIII - nucleotides 736-777 and residues 224-237; 

VDIV - nucleotides 928-1017 and residues 288-317. 

Epitope Biappinq has shown that three of the 
four VD domains (I, II and IV) contain contiguous 
antigenic determinants which elicit t*;e formation of 
serovar^ subspecies (determinants common to three or 
more serovars within a serogroup) , serogroup or 
species-specific antibodies. 

Variable domains I and II, which demonstrate 
the greatest amount of Inter serogroup sequence 
variation, are the locations of the serovar-specif ic 
determinants. Variable domain IV is the largest of 
the MOMP VDs and is located near the C- terminus of 
the protein. VDIV is the location of subspecies and 
serogroup antigenic determinants, as well as a 
highly conserved species-specific antigenic 
determinant. Antigenic determinants have not been 
mapped to VDIII, tii« smallest and least variable 
domain of the MOMP genes. 
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Variable domains I, II and IV of the MOMP of 
£• trachomatis protrude from the c^ill surface 
towards the external environjnent as denonstrated by 
their susceptibility to cleavage by trypsin, and 
5 their accessibility to antibody binding. Trypsin 
cleavage in both VDII and VDIV, but not within VOIV 
alone, decreases chlamydial attachment to HeLa cells 
suggesting that these domains, or conformational 
MOMP structures that are dependent on the integrity 
10 of these domains, may function as a chlamydial 
ligand. 

Based on the immunologicel and biological 
relationships among MOMP VD structure and function, 
and the fact that the MOMP VDs appear to be the 

15 major variable sequences between serovars, the 

present inventors sought to identify the nucleotide 
and amino acid sequences of the MOMP VDs I-IV of £. 
trachomatis serovars Ba, D, E, P, G, H, I, J, K and 
L3, and to confirm those of serovars B, K, C, LI and 

20 L2* 

SUMMARY OF THE INVEHTION 
The amino acid sequences of the major outer 
membrane protein genes from Chlamydia trachomatis 
serovars A, B, C, LI and L2 have been shorn to be 
25 conserved with the exception of four variable 
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domains in which major neutralizing and st.-rotyping 
anticjenic determinants have been located. Using 
oligonucleotide priner extension sequencing of MOMP 
nRNA, the nucleotide sequences of the four MOMP VDs 
5 of the remaining 10 Q, tr?»<;hgmatig serovars were 
determined by the present inventors and the amino 
acid sequences deduced therefrom. The sequencing 
technique was rapid and required minimal amounts of 
total RNA, i.e., only 35 nq of RNA from chlamydial 

10 infected HeLa cells was heeded to sequence all 4 
variable domains for a given serovar. Use of the 
disclosed technique also circumvented the more 
standard and laborious approach of molecular cloning 
and direct DNA sequencing which had been employed in 

15 the past to determine the sequences of MOMP gene to 
serovars A, B, C, LI and L2. 

AS noted above, the nucleotide sequences were 
employed to deduce the corresponding amino acid 
sequences. Comparative analysis of ti;«s amino acid 

20 sequence homology of the four variable domains 

permitted t^he present inventors to separate the 

fifteen serovars into three serogroups: 

Group 1 • serovars B, Ba, D, E, LI and L2; 
Group 2 - serovars G and F; 
25 Croup 3 - serovars A, C, H, I, J# K and L3. 

The four variable domains were also analyzed 

for imaunogenicity based on the hydrophilicity and 
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charge values of each domain (Baehr et al,, SliPX^) • 
The MOMP VDs with the greatest total hydrophil icity 
and charge values were found to be the location of 
antigenic deterainants recognized by MCMP specific 
5 monoclonal antibodies (Stephens et al., SMpra; Baehr 

et al. , 2IJ£i:^) . 

The nucleotide, aaino acid sequences and 
hydrophil icity/charge value analyses are 
advantageous because they will assist in the 

10 selection of appropriate MOMP antigenic determinants 
to be used in the construction of synthetic 
peptides, subunits or recombinant chlamydial 
vaccines. Examples of such recomliinant vaccines 
include infectious enteric vectors (avirulent 

15 Salmonell a tvphimurium and/or Sabin Type I polio 

virus) expressing MOMP variable domain sequences as 
antigenic chimeras or hybrid molecules. 

The present invention will allow the 
pi eduction of reagents and methodologies applicable 

20 in the development of new diagnostic tests for £. 
trachomatis infections and serological tests for 
serotyping. Specifically, species or sero-type 
specific oligopeptides conjugated to an array of 
reporter or detector groups could be used in various 

25 conventional non- immunologic assays including 
standard hybridization techniques. 
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Likewise, rijagents for use in immunologic or 
serologic tests could be prepared based on the 
sequences presently disclosed; the conserved VD-IV 
peptide sequence TTLNTTIAG would percit species 
5 specific identification, while the non-conser-zed 
sequences would enable one to make sero-type 
specific identification. It is contemplated by the 
present inventors that such reagents would be 
suitable for use in standard imaunologic procedures 
10 usch as ELISA and radioianunoassay techniques. 

The invention will be more fully described in 
the detailed description and drawings which follow. 

pF.<^CRTPTIO r^ DRAWINGS 

fig. 1 . Location of the MOMP gene sequences 
15 to which synthetic conpleaentary oligonucleotides 

were constructed and used for priner ejctension mlWA 
sequencing of MOMP VDs. The MOMP gene sequences 
identified by single lined boxes are those to which 
synthetic oligonucleotides were constructed. The 
20 double lined boxes identify the four variable domain 
regions of B, L2, A, C, and F MOMP genes. 

Fig. 2 . Sequencing of serovar B and A MOMP 
VOII by primer extension of MOMP mRNA. Total RNA 
was isolated from HeLa 229 cells infected with 
25 serovars B and A and annealed to kinased 



oligonucleotides and CP-2 (Table 1)/ 

respectively. The oligonucleotides were 
complementary to the non-coding strand of MOMP gene 
sequences 50 nucleotides dovtistream 3' of VDII. The 
RJNA templates were transcribed with reverse 
transcriptase, and the sequences of the transcribed 
DNAs detenftincd by the dideo>cynucleotide chain 
termination method. Audioradiographs were read to 
depict > strand MOMP DMA sequences. The nucleotide 
niombers sho%m for each MOMP are the beginning and 
end of the sequence encoding VDII of each protein. 
The results shown are representative of sequencing 
results obtained for all other serovars' VDs. 

FlQ. 3 . (A, B, C and D) The nucleotide 
sequences of MOMP VDs I, II, III and IV, 
respectively, of the 15 £. t^^^^QP^^^s serovars. 
The boxed regions of the sequences identify each of 
the four VOs. Serovars B, P and C MOMP sequences 
were used as prototypes for comparative sequence 
analysis. The nucleotide positions shown are for 
serovar B MOMP. Exact nucleotide positions for 
other MOMP VD sequences differ slightly because of 
insertions and/or deletions within the VDs. 

Fig, 4 . (A, B, C and D) The deduced amino 
acid sequences of the MOMP VDs 1, 2, 3 and 4, 
respectively, cf the 15 £. trachomatis serovars. 
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The boxed regions of the sequences identity each of 
the four VDs. Serovars B, F and C MOMP amino acid 
sequences were used as prototypes for comparative 
sequence analysis. 
5 fig. 5 . rrt^ChgwaUa serovar B MOMP 

variable domain sequence. 

Via. 6. irafilXfiffl&Ua serovar Ba MOMP 

variable domain sequence. 

Fig. 7 . SL. rr?ir*'»»«^^s serovar 0 MOMP 

10 variable domain sequence. 

yia. 8 . iL. trnchoaatiS serovar E MOMP 
variable domain sequence. 

fia. 9 . SL. rrnrhgwatiS serovar LI MOMP 
variable domain sequence. 

fia. 10 . rrnffhPPatig serovar L2 MOMP 

variable domain sequence. 

fia. 11 . SL. sevovar F MOMP 

variable domain sequence. 

Tia. 12 . trnChOMtiS serovar G MOMP 

20 variable domain sequence. 

r^a. 13 . SL. ir^ghQMtia serovar C MOMP 
variable domain sequence. 

fia. 14 . rrnChgffiflUa serovar A MOMP 

variable domain sequence. 

pg. 15 . rrnchqwa^ig serovar H MOMP 

variable domain sequence. 



15 



25 
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£icL_ll. 5^ trachomatis serovar I MCMP 
variable domain sequence. 

Fig. 17 . C. trachomatis serovar J MOMP 
variable domain sequence. 
5 Fig. 18 . trachomatis serovar K MOM? 

variable domain sequence. 

Zia*_12. So. traghgnatiS serovar L3 MpMP 
variable domain sequence. 

DETAILED DESCRIPTIOK OF THE INVENTIOK 
10 All references cited below are specifically 

Incorporated Into the specification by reference. 

I. MATERIALS Afip HETHQPg. 
A. £hl^jny:di^* 

£. trachomatis serovars: A/Har-13, B/TW- 

15 5/OT, Ba/AP-2, C/TW-3/0T, D/UW-3/CX, E/Bour, F/IC- 
Cal-13, G/UW-57/CX, H/UW-4/Cx, I/UW-12/Ur, J/UW-- 
36/Cx, K/UW-31/CX, LGV/Ll-440, LGV/L2-434 and 
LGV/L3-404 were employed herein and were immunotyped 
by the micro-lmmunof luorescence (micro-IF) procedure 

20 of Wang et al. (J. Infect. Dls. 152:791-800 (1985)). 
Serovars A, £ and P were obtained from the American 
Type Culture Collection (ATCC, Roc)cvllle, MD) . All 
other serovars were obtained from Dr. Cho-Chou Kuo, 
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Department of Pathobiology, University of 
Washington, Seattle, WA. 

B. RNft gytraction. 

Approximately 2 x 10* HeLa 229 cells grown in 
5 stationary culture were each infected with one 
chlamydial serovar. Infected HeLa cells were 
removed from the culture flasks with trypsin 24- 
hours post-infection for sercvrrs LI, L2 and 13, and 
36-hours post- infection for al . other serovars. 

10 Cell susjr-ensions were then po6l*id, centrifuged and 

washed twice in cold (4*C) phosphate-buffered saline 
(PBS), 0.02 M sodiim phosphate, 0.15 M KaCl, pH 7.2. 
Cells were lysed in 5 ml of cold guanidine 
isothiocyanate lysis buffer (Current Protocols in 

15 Molecular Biology, Wiley Inter Science, p. 4.2.4 

(1987)). The resulting suspension was then gently 
reflxixed through an 18 gauge needle 15-20 times to 
shear the DNA. 

Total RNA was extracted (2x) from the 

20 solution using hot acidic phenol (65*C, pH 5.0), 

followed by two extractions with chloroform/ isoamyl 
alcohol (24:1 vol/vol). The RNA from each pool was 
precipitated vith two volumes of cold ethanol (-20* 
C) • The precipitate was evaporated to dryness using 

25 a Speed-vac (Bachofer, Reutlingen, PGR) . The RNAs 



were resuspended in glass distilled water at a 
concentration of 20-40 Mg/nl and stored at -70 "C, 

C. Svnthotic oligonucleotides. 

Oligonucleotides (21 mers) complementary to 
the <** strand MOKP DNAs f landing the 3> end of each 
VD were synthesized using a SAMl automated synthesis 
instnament (Biosearch, San Rafael, CA) by the N- 
methyl imidazole phosphotriester method described by 
Grayston et al. (J. Infect. Dis. 122:87-104 (1975)) 
and according to the manufacturer's instructions. 
De-blocked oligonucleotides were evaporated to 
dryness using a Speed-Vac, and were then dissolved 
in distilled water at a concentration of 10-3 0 mg/ml 
and stored at -20 *C. Oligonucleotides were 
constructed according to the published sequences of 
the A, B, C, L2 MOMP gene serovars (Stephens et al;, 
J. Bacterid. 168 . suora and Baehr et al., su pra ) . 
as well as the unpublished F serovar sequence which 
was determined by the present inventors. 

The MOMP gene sequences to which synthetic 
complementary oligonucleotides were constructed, and 
their location in the MOMP genes are shown in Fig. 
1. The oligonucleotide number, its sequence, and 
the MOMP serovar and VD for which the 
oligonucleotide was used for primer extension mRNA 
sequencing are summarized below in Table I. 
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oligonucleotide was used for priaer extension mRNA 
sequencing are sunuaarized below in Table I. 
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TABLE 1. Oligonucleotides Used in Primer-extension 
aRNA Sequencing of £. t^rachomatis MOMP VDs 



Oligonucleotide Oligonucleotide 
number sequence 



Serovars and their VDS sequencing 
with oligonucleotide primers 



BP-1 

CP-1 

FP-1 

BFP-2 

CP-2 

BP- 3 

CP-3 

FP-3 

P-4-1 

P-4-2 
P-4-3 



5 • -ATCCCAAATATTCAATGCCAT-3 * 
ATCCCAGATATTTAATGCCAT 
ATCCAAATATTCAATGTCAT 
TCCACATTCCCACAAAGCTGC 
CCCACATTCCCAGAGAGCTGC 
TCTGTAAGAGAGAGCTAAACT 
AGTGAACATATTTAATCTGTA 
AGTGAACATATTGAGTCTGTA 
AATACCGCAAGATTTTCTAGA 

TCCTACTGCAATACCGCAAGA 
TTTTCTAGATTTCATCTTGTT 



VDIs of B,Ba,D,E,G,Ll,L2 
VDIS of A,C,H,1,J,K,L3 
VDIs of F,G 

VDIIs of B,Ba,D,E,F,G,Ll,L2 
VDlIs of A,C,H,I,J,K,L3 
VDIIIs of B,Ba,D,E,Ll,L2 
VDIIIS Of A,C,H,I,J,K,L3 
VDIIIs Of F,G 

VDIVs of A,B,Ba,C,D,E,F,H,I, 
J,K,L1,L2 

VDIVs of L3,G, (ND*) 
VDIV of J, (ND*) 
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Tha dideoxynucleotida. chain terminating 
nethod of Sanger at al. (Proc. Natl. Acad. Sci. USA 
2A:5<«3-5467 (1977) aodified for the use with 
5 r«verse trarscriptase and RNA templates as described 
by Lane et al. (Proc. Natl. Acad. Sci. USA fi2:6955- 
6959 (1985)) was used for sequence determinations, 
in suaanary, oligonucleotides were 5' *nd labeled 
with [''pjdATP (3.000 Ci/Mol - New England Nuclear. 
10 Boston. MA) using T4 polynucleotide kinase (10.000 
U/ml - New England Biolabs. Beverly, MA) as 
described by Manlatls et al. in ^ibfilfttaO-Mnual- 
pp. 1-545 (Cold spring Harbor Laboratory, Cold 
spring Harbor, HY (1982). Labeled oligonucleotides 
15 were isolated on Sephadex G-25 columns (Boehringer 
Mannheim Biocheaicals, Indianapolis, IN) . The 
oligonucleotide primers were then hybridized to MOMP 
BRNA templates in 12.5 Ml reaction mixtures 
containing 2.5 m1 5x annealing buffer (250 >M Tris- 
20 HCl, 300 mM NaCl, 250 mM DTT, and 5 mM EDTA, pH 

8.3), 7.5 Ml of total RNA (3-10 Hq/nD - 2.5 Ml 

of )cinased primer (2-2.5 mM) . 

The mixtures vera heated at 90 'C for 3 
minutes, centrifuged and then allowed to cool slowly 
25 to 45 "C over a 30 minute period. 2 Ml ot the 

hybridized mixture was then added to each of four 
tubes containing 3m1 of reaction mixture (250 mM 
Tris-HCl, pH 8.3, 300 mM MaCl, 50 mM DTT, 30 mM 
MgCl,, 2 mM dNTPs, 0.4 mM ddNTPs and 0.6 - 0.8 U/m1 
avian myeloblastosis virus reverse transcriptase 
(20,000 0/ml - Pharmacia Fine Chemicals. Inc., 
Piscataway, N.J.). The mixtures were incubated at 
4S'C for 40 minutes then the reactions were stopped 
by evaporating the tubes to dryness in a Speed-vac. 
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Following evaporation, ' the samples were resuspended 
in 3 Ml of 100% formamide containing 0.8% }cylene 
cyanol and 0.8% bromophenol blue and heated at 95 *C 
for 3 minutes* The samples were then promptly 
loaded on sequencing gels (8% polyacrylamide) to 
achieve separation. Following separation the gels 
vera fixed with methanol, dried then 
autoradiographed • 

II. S^wnco AnalYaia 

A. Primer extension and DWA sequencing. 
Using the oligonucleotide primers described 

above, the VD-encoding portions of the MOMPs mJWA 
vera sequenced. A representative sequencing gel is 
showTJ in Fig. 2 wherein M0rn> B-VDII and A-VOII are 
shown. Oligonucleotides BFP-2 and CP- .7 were 
eaplpyed in this sequencing as primers. The mRNA 
derived MOMP sequences depicted are identical to the 
previously published genomic DMA sequences for the 
MOMP B-VDII and A-VDII (Stephens at al., suora and 
Baehr et al • , auBU) # thereby demonstrating both the 
feasibility and accuracy of this sequencing method. 

B. Sequence analysis. 

The synthetic oligonucleotides used for 
sequencing the MOMP VDs of the 15 C. trachomatis 
serovars are those depicted previously in Table 1. 
The location of tha complementary sequences in the 
MOMP genes of serovars B, L2, A, C and F are sho%m 
in Fig. 3. All oligonucleotides were 21 mere and 
vera complementary to conserved nucleotide sequences 
40 to 60 nucleotides downstream (3*) from the VD to 
be sequenced. Because inter-serogroup (but not 
intra-serogroup) nucleotide variation occurs within 
these regions, oligonucleotides complementary to 



serogroup specific MOMP sequences were constructed 
for use in the sequencing of the variable domains of 
the serovars within serogroup. The 40 to 60 
nucleotide distance between the VO and the 
downstreaa oligonucleotide pri»er site was necessary 
to produce high resolution sequences of the VDs. 

The nucleotide and deduced amino acid 
.•quences of the 4 variable do«ains for each of the 
15 C i>^>r.ho«atis serovars are individually depicted 
in Figs. 5-19. These sequences are also presented 
comparatively in Fig. 3 (nucleotide sequences) and 
Pig. 4 (deduced amino acid sequences). The B, F and 
C serovars' MOMP VDs are used as prototype sequences 
for comparative analyses of serovar VDs within the 
B, intermediate and C serogroup, respectively, k 
discussion of the results of the comparative . 
analyses of the nucleotide and amino acid sequences 
of the four VDs for each £. tmghgBatiS serovar is 
presented below. 

1. Yar1fl^^'' nonaln I. VDI contains 60 
nucleotides encoding 20 amino acids for all B 
serogroup serovars. 66 nucleotides encoding 22 amino 
acids for serovars K, L3 and all C serogroup 
serovars, and 63 nucleotides encoding 21 amino acids 
for serovars P and G. The nucleotide sequences of 
VDIs from B serogroup serovars are highly conserved 
with only 3 to 5 substitutions per serovar as 
compared to the B serovar prototype sequence. The 
deduced amino acid sequences of VDIs of B serogroup 
serovars are also conserved with only 2 to 4 
substitutions per serovar, with the majority of 
substitutions occurring within a 12 amino acid 
region (residues 68-79) that resides in the central 
portion of the domain. 



In contrast to the above, serovars in the C 
serogroup and serovars K and L3 show greater 
nucleotide variation i.n VDI than the B serogroup. 
Compared to C VOX, tha other C serogroup serovars 
have 1 to 9 nucleotide substitutions in VDI that 
result in 0 to 4 amino acid changes. As above, the 
majority of these substitutions occur in the central 
region of the domain. 

VDIs of serovars P end C are identical in 
t^»eir nucleotide and amino acid sequences. 

2. Variable Domain II- VDII contains 63 
nucleotides encoding 21 amino acids in the MOMP of 
serovars D, E, LI, P and G. The VDIIs of the 
remaining 10 serovars have 66 nucleotides encoding 
22 amino acids. VDII shows more sequence variation 
than VDI in all serovars. As compared to serovar B, 
other R serogroup serovars have 2 to 19 nucleotide 
substitutions resulting in 2 to 8 amino acid 
changes. The least amount of variation occurs 
between serovars B and Ba in which there are two 
nucleotide substitutions that each result in an 
amino acid substitution. Most of t*nc amino acid 
substitutions occur within an 8 amino acid region in 
the central part of the domain (residues 144 to 
151). 

The vpils of C serogroup serovars show 
considerable sequence variation having 4 to 8 
nucleotide substitutions which result in 3 to 6 
amino acid changes. A similar amount of variation 
was found in VDII sequences between the P and C 
serovars of the intermediate serogroup. Compared to 
serovar P VDII sequences, serovar G VDII has 6 
nucleotide substitutions resulting in 3 amino acid 
changes. 



3- Variable d omain III. VDIII is the 
smallest and least variable of the four domains, 
VDIII contains 42 nucleotides which encode 14 amino 
acids. The VDIII sequences of A, H, J and L3 are 
identical to each other. Serovars D, LI and F have 
identical sequences which differ from serovar B by 3 
amino acid substitutions. The remaining serovars 
have sequences differing from serovar B by I to 4 
amino acid substitutions. 

4.. Yft^riable Domain IV. VDIV is the largest 
variable domain containing 96 nucleotides encoding 
32 amino acids in serovars A, I €ind all B serogroup 
serovars, and 99 nucleotides encoding 33 amino acids 
in the remaining serovars. The VDIV domain can be 
separated ii>to three dis^^^inct regions baseil on 
sequence homology: (i) the N-terminal region 
(residues «88-295) , (ii) the central region 
(residues 296-306), and (iii) the C-terminal regions 
(residues 307-317) . 

Using micro-immunof luorescence polyclonal 
mouse sera, the 15 serovars have been separated into 
three serogroups. Croup 1 or the B serogroup 
contains serovars B, Ba, D, LI and L2 ; group 2, 
the intermediate serogroup, serovars F and G; and 
group 3, the C serogroup serovars A, C, H, 1, J, K 
and L3. 

Within the B serogroup, serovars B and Ba 
have identical sequences in the N-terminal regions, 
as do serovars D and E. There are three amino acid 
substitutions between serovars B-Ba and D-E. 
Serovars LI and L2 contain one and two additional 
amino acid substitutions in this region, 
respectively. Within the C serogroup, serovars C, 



H, 1, J and K have identical sequences in the N- 
terminal re'^ion of VDIV. Serovar L3 contains a 
single amino acid substitution, Serovar A contains 
the most variable sequence in this region, 
containing 4 sequence substitutions as compared to 
the prototype C serovar sequence. Serovars F and G 
differ in this region by two amino acids. 

The central region of VDIV is the aos*. highly 
conserved sequence among the VDs of the chlamydial 
serovars • A nine amino acid sequence (TTLNPTIAG) is 
conserved in all serovars except serovar K, which 
contains a threonine (T) instead of an alanine (A) 
(residue 303). The central KPT sequence in VDIV is 
present in all 15 serovars. 

The C-terminal region of VDIV is highly 
conserved among the B serogroup servars. In 
contrast, this region of VDIV in C serogroup 
serovars shows a significant amount of amino acid 
variation, although the amino acid composition is 
similar (predominately S, T, E end A) . Serovars F 
and G differ by three amino acid residues in this 
region. 

In contrast to the historical classification 
noted above, wherein serovars K and L3 are placed 
within the C serogroup, immunotyping with monoclonal 
antibodies specific to MOMP has been recently 
employed and serovars K and L3 have been 
reclassified to the intermediate serogroup. Based 
on the present invention and comparative MOMP VD 
amino acid homology analysis, however, serovars K 
and L3 clearly belong in the historical C serogroup. 



homologies. 

The percent of amino acid homology and the 
total number of amino acid substitutions for the VDs 
of each serovar are shown in Table 2. 
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Inter-group (1, 2 and 3) VD amino acid 
homology is between 33 and 46%. Intra-group VD 
amino acid homology wa^ between 74 and 94\ for 
groups 1 and 3, and 85% tor group 2. Therefore, 
5 classification of £, iX2ihi2Jn^tiS isolates Vased on 
MOMP VD amino acid homology correlates extremely 
well with the historical serological class fication 
and generally with immunotyping using MOMP specific 
monoclonal antibodies (see e.g., Wang et al,, 

10 fiUPXA) • The present inventors note, howevsr, that 
recent immunot>T3ing using monoclonal antibodies 
places serovars K and L3 within the intermediate 
serogroup (serovars F and G) , whereas VD amino acid 
homology comparison shows serovars K and L3 to be 

15 more closely related to serovars within the C 
serogroup. 

Ill , ipTTiunoqenicitv b ased on hydrophilicifc^L^ard 
^haroe values- 

The VDs are hydrophilic and charged; 

20 properties indicating that these sequences are 

associated with the chlamydial cell surface and are 
potential antigenic sites. In fact, major antigenic 
sites which elicit the formation of protective 
serotyping antibodies have been mapped to VDI, VDII, 

25 and VDIV of the MOMP of serovars A, B, C and L2 
(Baehr et al., suora ) > However, the antigenic 
properties of the VDs of the remaining 11 serovars 
have not as yet been described. 

In order to define the antigenicity of 

30 these domains, the MOMP VDs of the chlamydial 

serovars were analyzed for hydrophilicity and charge 
using standard measurements and calculations. The 
number of charged amino acids and the hydrophilicity 
value of sequences within each MOMP VD for the 15 
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serovars was determined. These values are provided 
in Table 3. 



- - 



$ As the data demonstrates, the VDs have a broad range of 

hydrophilicity and charge. Tne VDI sequences of serovars in the B 
serogroup had an average hydrophilicity value of 4.47, and contained \ 
an average of 2 positive (basic) charges and 1.3 negative (acidic) 
charges, while the VDIl sequences had an average hydrophilicity value 

loof 6.95, and contained an average of 1.35 positive charges and 3.2 
negative charges. Based on these results, the VDII sequences of 
serovars within the B serogroup are predicated to have the greater 
immunogenic potential between the two. 

Indeed, as reported by Stephens et al. (J. Exp. Med., 

iSfiUECa) epitope mapping studies using antisera prepared against 
synthetic peptides corresponding to MOMP VDs show that serovar- 
specific determinants are located in the VDII of the MOMP of serovars 
B, C and L2, and a highly conserved species-specific MOMP determinant 
is mapped to VDIV as depicted in Table 4. 




\ 



In contrast to the above, VDI sequences of C serogroup 
serovars MOMP were more hydrophilic and charged than VDII, 
The average hydrophilicity for VDI was 5,3 with 1.4 positive 
and 2.7 negative charges^ while the hydrophilicity value for 
VDII was -1.24 with 2 positive and 0.7 negative charges. 
These data suggest that VDI sequences of C serovars have a 
greater immunogenic potential than VDII sequences. This 
conclusion is supported by the work of Baehr et al., supra , 
whOf using a laa&bda-*gtll epitope mapping procedure showed that 
the serovar-specif ic MOMP epitope for serovar A and L2 mapped 
to VDI and VDII, respectively, whereas subspecies, serogroup 
and species-specific epitopes mapped to VDIV. Specifically, 
the epitope recognized by monoclonal antibody A-20, which is 
specific for serovar A MOMP has been mapped to VDI (Table 4). 

As with the C serogroup, the MOMP VDI sequences of the 
intermediate serovars F and G were more hydrophilic and 
charged than the VDII sequences. Several serovars (D, E and 
H) contain both VDI and VDII sequences which are extremely 
hydrophilic and charged, suggesting that in MOMPs of these 
serovars, both domains may be immunogenic. 

Although antigenic sites have not yet been mapped to 
VDIII, its hydrophilicity and charge values implicate this 
domain as a potential immunogenic site as well. 

As noted previously, VDIV has been divided into three 
regions, (i) N-terminal (residues 284-295), central (residues 
296-306) and C-terminal (residues 307-317). The 



10 



15 
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hydrophilicity and charge for each region has been 
independently calculated. 

The VDIV domain shows a consistent pattern of 
hydrophilicity and charge in each region for MOMPs pf all 
serovars. The N-teminal region is charged but is weakly 
hydrophilic; the central region is consistently uncharged «nd 
is hydrophobic; and the C-tenainal region is both highly 
charged and hydrophilic, with the exception of serovars F and 
G which have only i»o<ilerate charge and hydrophilicity. 

The single exception in correlating hydrophilicity and 
charge with known MOMP antigenic detenainantr. is the species- 
specific MOMP epitope recognized by monoclonal antibody L21- 
10. This epitope is located in the central region of VDIV 
(TTLNPTIAG) and is uncharged and strongly hydrophobic (-5.1). 
From the epitope data, it appears that serovar-specif ic 
determinants are located in MOMP VDI, VDII or both, while 
common or serogroup-specif ic MOMP antigenic determinants are 
located in VDIV. 



ABSTRAC T OF INVENTION 
The nucleotide and deduced amino acid sequences of the fou 
vari&ble domains of the irajor outer oeiabrane proteins of the 
15 serovars of ChlaPiYdia trachoaatis are disclosed together 
with sequence and imfflunogenic analysis of these domains. 
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